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Abstract 


We study the satisfiability of ordering constraint satisfaction problems (CSPs) above average. We 
prove the conjecture of Gutin, van lersel, Mnich, and Yeo that the satisfiability above average of ordering 
CSPs of arity k is fixed-parameter tractable for every k. Previously, this was only known for k = 2 and 
fc = 3. We also generalize this result to more general classes of CSPs, including CSPs with predicates 
defined by linear inequalities. 

To obtain our results, we prove a new Bonami-type inequality for the Efron—Stein decomposition. 

The inequality applies to functions defined on arbitrary product probability spaces. In contrast to other 
variants of the Bonami Inequality, it does not depend on the mass of the smallest atom in the probability 
space. We believe that this inequality is of independent interest. 

1 Introduction 

In this paper, we study the satisfiability of ordering constraint satisfaction problems (CSPs) above the aver¬ 
age value. An ordering k-CSP is defined by a set of variables V = {xi, ..., Xn} and a set of constraints 
n. Each constraint vr G 11 is a disjunction of clauses of the form x^ < < • • • < for some distinct 

variables , • • •, x^j, from a /c-element subset 14 C V. A linear ordering a of variables xi,..., x^ satisfies 
a constraint vr if one of the clauses in the disjunction agrees with the linear ordering a. The goal is to find 
an assignment a that maximizes the number of satisfied constraints. 

A classical example of an ordering CSP is the Maximum Acyclic Subgraph problem, in which con¬ 
straints are of the form “xj < x/’ (the problem has arity 2). Another well-known example is the Be¬ 
tweenness problem, in which constraints are of the form “(xj < xj < x^) or (x^ < xj < Xj)” (the 
problem has arity 3). Both problems are WP-hard and cannot be solved exactly in polynomial-time unless 
P = VP mull (see also llMl|9ll26l). 

There is a trivial approximation algorithm for ordering CSPs as well as other constraint satisfaction 
problems: output a random linear ordering of variables xi,..., x^ (chosen uniformly among all n! linear 
orderings). Say, if each constraint is just a clause on k variables, this algorithm satisfies each clause with 
probability l/k\ and thus satisfies a 1/A;! fraction of all constraints in expectation. In 1997, Hastad 1201 
showed that for some regular (i.e., non-ordering) constraint satisfaction problems the best approximation 
algorithm is the random assignment algorithm. His work raised the following question: for which CSPs 
are there non-trivial approximation algorithms and for which CSPs is the best approximation algorithm the 
random assignment? This question has been extensively studied in the literature. Today, there are many 
known classes of constraint satisfaction problems that do not admit non-trivial approximations assuming the 
Unique Games or P 4 -VP conjectures (see e.g ll20l [^[T4ll5ll). There are also many constraint satisfaction 
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problems for which we know non-trivial approximation algorithms. Surprisingly, the situation is very dif¬ 
ferent for ordering CSPs: Guruswami, Hastad, Manokaran, Raghavendra, and Charikar |[T3l showed that all 
ordering /c-CSPs do not admit non-trivial approximation assuming the Unique Games Conjecture. 

A similar question has been actively studied from the fixed-parameter tractability perspectiv^i] lEl HI 111 
[TSl [231 : Given an instance of a CSP, can we decide whether OPT > AVG + t for a fixed parameter 
t? Here, OPT is the value of the optimal solution for the instance, and AVG is the expected value on a 
random assignment. In 2011, Alon, Gutin, Kim, Szeider, and Yeo fT] gave the affirmative answer to this 
question for all (regular, non-ordering) k-CSPs with a constant size alphabet. In lITSl ITTl . Gutin et al. 
extended this result to 2-arity and 3-arity ordering CSPs, and conjectured ifT^ that the satisfiabilty above 
average is fixed-parameter tractable for ordering CSPs of any arity k. Below we state the problem formally. 

Definition 1 (Satisfiability of Ordering CSP Above Average). Consider an instance T of arity k and a 
parameter t. Let OPT = OPT(X) be the number of the constraints satisfied by the optimal solution, and 
AVG = AVGiX) be the number of constraints satisfied in expectation by a random solution. We need to 
decide whether OPT > AVG + t. 

Definition 2. A problem with a parameter t is fixed-parameter tractable if there exists an algorithm for the 
problem with running time g{t)poly{n), where g{t) is an arbitrary function oft, poly is a fixed polynomial 
(independent oft), and n is the size of the input. 

In this paper, we prove the conjecture of Gutin et al. 1(1^ and show that the satisfiability above average 
of any ordering CSP of any arity k is fixed-parameter tractable. 

Theorem 3. There exists a deterministic algorithm that given an instance I of an ordering k-CSP on n 
variables and a parameter t, decides whether OPT (I) > AVG{X) -\-1 in time g{t)polyk{n), where g is a 
function oft, polyk is a polynomial ofn (g and polyt depend on k). If OPT(X) > AVG{X) 1, then the 
algorithm also outputs an assignment satisfying at least AVG{I) 1 constraints. 

Furthermore, we prove that the problem has a kernel of size Ok{t^)- 

Techniques. Let us examine approaches used previously for ordering CSPs. The algorithms of Gutin et 
al. ifTSlfT^fTTl work by applying a carefully chosen set of reduction rules to ordering CSPs of arity 2 and 
3. These rules heavily depend on the structure of 2 and 3 CSPs. Unfortunately, the structure of ordering 
CSPs of higher arities is substantially more complex. Here is a quote from ifThll : “it appears technically 
very difficult to extend results obtained for arities r = 2 and 3 to r > 3.” In this paper, we do not use such 
reductions. 

The papers [TSl |25l use an alternative approach to get an advantage over the random assignment for 
special families of ordering CSPs. They first reduce the ordering fc-CSP to a regular k-CSP with a constant 
size alphabet, and then work with the obtained regular A;-CSP. However, this reduction, generally, does not 
preserve the value of the CSP. So if for the original ordering CSP instance X we have OPT(X) > AVG{X) -\- 
t, then for the new instance X' we may have OPT{X') <C AVG{X)-\-t (we note that AVG{X') = AVG{X)). 
In this paper, we do not use this reduction either. 

Instead, we treat all ordering CSPs as CSPs with the continuous domain: Our goal is to arrange all vari¬ 
ables on the interval [—1,1] so as maximize the number of satisfied constraints. The arrangement of variables 
uniquely determines their order. Moreover, if we independently assign random values from [—1,1] to vari¬ 
ables Xi, then the induced ordering on Xj’s will be uniformly distributed among all n! possible orderings. 
Thus, our reduction preserves the values of OPT and AVG. However, we can no longer apply Fourier 

*We refer the reader to an excellent survey of results in this area by Gutin and Yeo CD. 
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analytic tools used previously in H] |T5l HH. We cannot use the (standard) Fourier analysis on [— 1 , 1 ]”, 
since we have no control over the Fourier coefficients of the functions we need to analyze. Instead, we work 
with the Efron—Stein decomposition ifTTl (see Sections I^and l3.3l) . We show that all terms in the Efron— 
Stein decomposition have a special form. We use this fact to prove that an ordering k-CSF that depends on 
many variables must have a large variance. Specifically, we show thaf if a fc-CSP instance depends on Ckt^ 
variables, then the standard deviation of its value from the mean (on a random assignment) is greater than 
Cfcf (for some Ck and ^ 1). As is, this claim does not imply that OPT > AVG + t since for some 
assignments the value may be substantially less than AVG — t. To finish the proof of the main result, we 
prove a new hypercontractive inequality, which is an analog of the Bonami Eemma [4]. This inequality is 
one of the main technical contributions of our paper. 

Theorem 4 (Bonami Eemma for Efron—Stein Decomposition). Consider f G L 2 {CP, Let f = J2sfs 
be the Efron—Stein decomposition of f. Denote the degree of the decomposition by d. Assume that for every 
set S, 

\\fs\\t^^[fl]<G\\fsf 2 ^GE[fl]\ ( 1 ) 

Then 

WfWl = E [/(Xi,..., X„)^] < 81''C||/||^ = Sl'^CE [/(Xi,..., X„)2]' . (2) 

We note that hypercontractive inequalities have been extensively studied under various settings (see 
e.g., Il3llll0l|32l|271). However, all of them depend on the mass of the smallest atom in the probability 
space. In our case, the smallest atom is polynomially small in n, which is why we cannot apply known 
hypercontractive inequalities. This is also the reason why we need an extra condition ([T]) on the function 
/. Condition ([T]) is a “local” condition in the sense that all expectations in ([Hi are over at most d variables 
for every set S. Consequently, as we will see below, it is very easy to verify that it holds in many cases (in 
contrast to (O, which is very difficult to verify directly). Note also that condition ([Til is necessary — if it is 
not satisfied, fhen the ratio II/II 4 /II/II 2 can be arbitrarily large even for d = 1 . 

Extensions. Once we assume that the domain of every variable is the interval [—1,1], we might be tempted 
to write more complex constraints than before such as “the average of xi, X 2 and X 3 is at most X 4 ”, or 
“xi lies to the left of the midpoint between X 2 and xf, or “xi is closer to X 2 than to X 3 ”. Each of these 
constraints can be written as a system of linear inequalities or a disjunction of clauses, each of which is 
given by a system of linear inequalities. Eor instance, “xi lies to the left of the midpoint between X 2 and X 3 ” 
can be written as 2xi — X 2 — 2:3 < 0. In Section |9l we extend our results to CSPs in which every constraint 
is a disjunction of clauses, each of which is a “small” linear program (EP). Namely, each constraint should 
have arity at most k, only variables that a constraint depends on should appear in the EPs that define if, and 
all EP coefficients must be integers in the range {—b ,..., 6 } (for a fixed b). We call this new class of CSPs 
(fc, 6 )-EP CSPs. 

Definition 5. A (/c, b)-LP CSP is defined by a set of variables V = {xi,..., Xn} taking values in the interval 
[—1,1] and a set of constraints H. Each constraint vr G H A a disjunction of clauses of the form Ax < c, 
where A is a matrix with integer coefficients in the range [— 6 , b]; c is a vector with integer coefficients in the 
range [—b, b]; the indices of non-zero columns of the matrix A lie in the set of size k (the set 14- A the 
same for all clauses in vr ). The goal is to assign distinct real values to variables xi so as to maximize the 
number of satisfied constraints. 

In fact, we extend our results to a much more general class of valued CSPs - all CSPs whose predicates 
lie in a lattice of functions with some natural properties (see Sections [ 8 ] and |9] for details); but we believe 
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that the subclass of {k, 6 )-LP CSPs is the most natural example of CSPs in the class. Observe that every 
ordering k-CSP is a {k, 1)-LP CSP since we can write each clause xi < X 2 < ■ ■ ■ < Xk the system of 
linear equations Xi — Xj+i < 0 for i G {1,..., /c — 1}. Similarly, every A:-CSP on a finite domain {1,..., d} 
is equivalent to a {k, (i)-LP CSP The reduction works as follows: We break the interval [—1,1] into d equal 
subintervals {{2j — d — 2)/d, {2j — d)/d) and map every value j to the j-th interval. Then, we replace 
every condition Xi = j with the equation Xi G ((2j — d — 2)/d, {2j — d)/d) which can be written as 
—dxi < —{2j — d—2) and dxi < (2j — d). 

Overview. In the next section we give an informal overview of the proof. We formally define fhe problem 
and describe fhe Efron—Sfein decomposition in Secfion[3] Then, in SeclionlH we prove several claims abouf 
fhe Efron—Sfein decomposition of ordering CSPs. We derive fhe main resulfs (Theorem |3] and Theorem [ 8 ]l 
in Section!^ Einally, we prove fhe Bonami Lemma for fhe Efron—Stein decomposition in Section |7] We 
generalize our results to all CSPs with a lattice structure in Section[ 8 ]and show that {k, 6 )-LP CSPs (as well 
as more general “piecewise polynomial” CSPs) have a lattice structure in Section |9] 

2 Proof Overview 

Our high-level approach is similar to that developed by Alon et al. [T] and Gutin et al. ifT^ [TTl [161. As 
in uminiiia, we design an algorithm that given an instance Z of an ordering CSP does the following: 

• It either finds a kernel (anofher insfance of fhe ordering CSP) /C on O ) variables such fhaf OPT (X) = 

OPT{)C) and AVG{X) = AVG{1C). Then we can decide whefher OPT > AVG + f by frying ouf 
all possible permufafions of variables fhaf K, depends on in time exp( 0 (f^ logf)). 

• Or if certifies fhat OPTiX) > AVG{X) + t. 

To fhis end, we show fhaf eifher X depends on af mosf 0{t^) variables or fhe variance of vali(a) is O(f^) 
(where a is chosen uniformly al random). In fhe former case, fhe reslricfion of X lo fhe variables if depends 
on is fhe desired kernel of size 0{t^). In fhe laffer case, OPT > AVG + t. Though our approach resembles 
fhaf of liT^fTTlfT^ af fhe high level, we employ very differenf fechniques lo prove our resulfs. 

We exlensively use Eourier analysis and, specifically, fhe Efron—Sfein decomposition. Eourier analysis 
is a very powerful fool, which works especially well wilh producl spaces. The space of feasible solutions of 
an ordering CSP is nol, however, a producl space — if is a discrete space that consists of n! linear orderings 
of variables xi,, Xn. To overcome this problem, we define “conlinuous solutions” for an ordering CSP 
(see Secfion lT^ . A solution is an assignmenl of real values in [—1,1] fo variables xi,..., fhaf is, if is a 
poinf in fhe producl space [—1,1]”. Each continuous solution defines a combinaforial solution a in a nalural 
way: a orders variables xi,..., according lo fhe values assigned lo Ihem (e.g., if we assign values —0.5, 
—0.9 and 0.5 fo xi, X 2 and X 3 Ihen X 2 < xi < X 3 according fo a). Thus we gel an optimization problem 
over fhe producl space [—1,1]”. Denofe by d> : [—1,1]”^ —>• M ils objective function. We consider fhe 
Efron—Sfein decomposition of <h: T* = X ]5 |S|<fc Secfion lT^ . Here, informally, $5 is fhe pari of 

$ fhat depends on variables Xj wilh i £ S. All functions ‘hg are uncorrelaled: E = 0 for Si 7 ^ S 2 . 

We show fhaf each $5 is eifher idenlically equal to 0 or has variance greater than some positive number, 
which depends only on k (see SectionHl EemmafT^. We now consider two cases. 

I. In the first case, there are at most 0^(1^) terms $5 not equal to 0. Using that ‘hg depends only on 
variables x* with i G ^ there are at most sets S such that <I >5 / 0, we get that 

depends on at most Ok{t^) variables and we are done. 
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II. In the second case, there are at least terms <I >5 not equal to 0. Since the variance of each term 
is 0fc(l) and all terms $5 are uncorrelated, the variance of is at least 0fc(t^) (see Theorem [13]). 
Therefore, deviates from AVG = E [<h] by at least VVar T* = 0(t). We then show that <1> — E<I> satisfies 
the conditions of Theorem |4| (see Lemma O and the degree of the decomposition is at most k. Thus, the 
ratio ||<h — E<h|| 4 /||‘h — E‘h ||2 is bounded by 0^(1). This implies by a theorem of Alon, Gutin, Kim, Szeider, 
and Yeo [T] (see Theorem [Tsl in Sectionj^ that Pr(<I> — E<h > f) > 0. Hence, OPT > AVG + t. 

3 Preliminaries 

3.1 Ordering CSP 

Consider a set of variables V = {xi ,..., Xn}- An ordering constraint vr on a subset of variables Xi^,..., Xi^, 
is a set of linear orderings of xq,..., . A linear ordering a of V satisfies a constraint vr on Xj^ ,..., Xj^, 

if the restriction of a to Xj^,..., Xj^, is in tt. We say that vr depends on variables xq ,..., x^^ . 

Definition 6. An instance X of an ordering constraint satisfaction problem consists of a set of variables 
V = {xi,..., Xn} and a set of constraints 11; each constraint vr E P depends on some subset of variables. 
A feasible solution to X is a linear ordering of variables xi, ... ,Xn. The value val(a) = vali(a) of a 
solution a is the number of constraints in H that a satisfies. The goal of the problem is to find a solution of 
maximum possible value. 

We denote the value of the optimal solution by OPT: 

OPT = maxval(a). 

The average value AVG of an instance is the expected value of a solution chosen uniformly at random 
among n! feasible solutions: 

AKG = E„[val(a)]. 

We say that X has arity k if each constraint in X depends on at most k variables. 

Definition 7. In the Satisfiability Above Average Problem, we are given an instance of an ordering CSP 
of arity k and a parameter t. We need to decide if there is a solution a that satisfies at least AVG + t 
constraints, or, in other words, if OPT > AVG + t. 

In this paper, we show that this problem is fixed-parameter tractable. To this end, we design an algorithm 
that either finds a kernel on Of?) variables or certifies that OPT > AVG + t. 

Theorem 8. There is an algorithm that given an instance of an ordering CSP problem of arity k and a 
parameter t, either finds a kernel on at most variables (where constant depends only on k) or 
certifies that OPT > AVG +1 . The algorithm runs in time Ok{m + n) linear in the number of constraints 
m and variables n (the coefficient in the O-notation depends on k). 

3.2 Ordering CSPs over [—1,1]” 

Consider an instance X of an ordering CSP on variables xi,..., x^. Let us say that a continuous feasible 
solution to X is an assignment of distinct values xi,..., x^ E [—1,1] to variables xi,..., x^. Each contin¬ 
uous solution xi,..., x„, defines an ordering a of variables xp Xa is less then Xb with respect to a if and 
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only if Xa < Xb- We define the value of a continuous solution xi,..., as the value of the corresponding 
solution (linear ordering) a. We will denote the value of solution xi,..., x„ by d>(xi,..., Xn)- 

Note that if we sample a continuous solution xi,..., x„ uniformly at random, by choosing values Xj 
independently and uniformly from [—1,1], the corresponding solution a will be uniformly distributed among 
n! feasible solutions. Therefore, 

OPT = max T>(xi,..., x„) and ^1/G = i|^>(xi,..., x^). 

Note that all Xj are distinct a.s. and thus a random point in [—1,1]” is a feasible continuous solution a.s. 

3.3 Efron—Stein Decomposition 

The main technical tool in this paper is the Efron—Stein decomposition. We refer the reader to ifl^ Section 
8.3] for a detailed description of the decomposition. Now, we just remind its definition and basic properties. 

The Efron—Stein decomposition can be seen as a generalization of the Eourier expansion of Boolean 
functions on the Hamming cube {±1}"^. Consider the Eourier expansion of a function / : {±1} —)• M, 

f{xi,...,Xn) = ^ fsXs{xi,...,Xn), 

where fs are Eourier coefficients of /. Informally, the Eourier expansion breaks / into pieces, fsXs{xi, • • •, Xn), 
each of which depends on its own set of variables: The term fsXs{xi, ■ ■ ■ ,Xn) depends on variables 
{xj : t G S'} and no other variables. 

The Efron—Stein decomposition is an analogue of the Eourier expansion for functions defined on ar¬ 
bitrary product probability spaces. Consider a probability space (H, //) and the product probability space 
fi^). Eet / : fl” —)• M be a function (random variable) on Informally, the Efron—Stein decompo¬ 
sition of / is the decomposition of / into the sum of functions fs, 

in which /^ : 17” —)• M depends on variables {xj : i G Sj. 

We formally define the Efron—Stein decomposition as follows. Consider the space L2(17”,|U”) of 
functions on with bounded second moment. Note that ^2(17”,^”) = L2{0.,^). That is, every 

/ G L 2 {TP, /r”) can be represented as 

/(xi,...,Xn) = Y^f({xi) ■ f 2 {x 2 ) ■ ■ ■ fniXn), 
j 

for some functions G ^2(17, /r). Eet Aq C ^2(17, /r) be the one-dimensional space of constant functions 
on 17. Eet Ax C L2(17,/i) be the orthogonal complement to Aq. That is. Ax is the space of functions 
/ G L2{0, fi) with E [/] = 0. We have, L 2 {Tl, /r) = Aq © Ax and 

n n 

L2(17”, ^P) = (g) L 2 {n, fi) = (g)(Ao © Ax). 

i=\ i=l 
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Expanding this decomposition, we get a representation of ^ 2 ( 0 ", as the direct sum of 2"^ spaces: 

L 2 ( 0 -,^")= 0 Fs, 

Sc{l,...,n} 

where Vs is the closed linear span of the set of functions of the form nr=i fi{xi) where /* G A± if i G 5, 
and /i G Aq if i ^ S. Since functions in Aq are constants, Vs equals the closed linear span of the set of 
functions of the form Hies where /j G Aj^. 

Consider a function / G ^ 2 ( 12 "^, /r”). Let fs be the orthogonal projection of / onto 1 / 5 . Since the linear 
spaces Vs are orthogonal, we have 

Sc{l,...,n} 

We call this decomposition the Efron—Stein decomposition of /. We define the degree of / as max{|5| ; 
fs / 0}, the size of the largest subset S such that fs is not identically equal to 0 (we let the degree of 0 to 
be 0 ). 

Let {Xi ,..., Xn) be a random element of That is, Xi,..., Xn are n independent random elements 
of ri; each of them is distributed according to /x. We write / = f{Xi,... ,Xn)- We will employ the 
following properties of the Efron—Stein decomposition (see ifT^ Section 8.3]). 

1- fs{xi, • • •, Xn) depends only on variables Xi with i £ S. 

2. Eor every two sets S and T, S we have E [fsfr] = 0. 

3. Let 5i,..., Sr be subsets of {1,... , n}. Suppose that there is an index j that belongs to exactly one 
set Si. Then E [ni=i fsi] = 0. 

We will also use the following equivalent and more explicit definition of the Efron—Stein decomposi¬ 
tion. Eor every subset S of indices re}, let 

fcs = E if {Xu ...,Xn)\ all Xi with f G S], (3) 

/s'= ^ (4) 

TCS 


4 Efron—Stein Decomposition of Ordering CSP Objective 

In this section, we study the Efron—Stein decomposition of the function <l)(xi,..., Xn). To this end, we 
represent <l>(a:i,... ,Xn) as a sum of “basic ordering predicates” and then analyze the Efron—Stein of a 
basic ordering predicate. 


4.1 Basic Ordering Predicate 

Let r = (ri,..., Tr) be a tuple of distinct indices from 1 to re. Define the basic ordering predicate for r. 


(f>T- (xi, . . . , Xn) 


1, if ^T2 V ''' , 

0, otherwise. 
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Note that the indieator of eaeh eonstraint vr is a sum of ordering predieates: 


^ ^ • • • iXn)i 

t: ordering x-r^ <a;- 7-2 is in tt 

where the sum is over permutations of variables that the eonstraint tt depends on. Sinee is the sum of 
indieators of all predieates tt in If, is also a sum of basie ordering predieates (/>,-: 

$(xi, ..., Xn ) = '^ . . ., Xn ), 

tGT 


for some multiset T. 


4.2 Efron—Stein Decomposition of Ordering Predicates 

Let Q = [—1,1] and be the uniform measure on fl. We study the Efron—Stein deeomposition of a basie 
ordering predieate (p^- 


Theorem 9. Let t be a tuple of distinct indices of size d < k. Denote g = fr- Consider the Efron—Stein 
decomposition of g, g = Yl9s> over [—1,1]” with uniform measure. There exists a set of polynomials qsy 
with integer coefficients of degree at most d such that 


9s(xii ■ ■ ■ j Xn) — ^ ^ Pt' 1 , • • • : Xn) 


QS.t' {xi , . . . , Xn) 

¥d\ ^ 


where the summation is over all permutations t' of S. The polynomial qsy depends only on variables in 
{xi : i G 5}. /t is equal to 0 if S is not a subset of {ti, ..., r^}. 

Proof We may assume without loss of generality that r = {1,2,... , d}. Sinee g depends only on variable 
xi,..., Xd, 55 / 0 only if 5 C {1,..., d}. We may therefore assume that n = d for notational eonvenienee. 

Denote the elements of 5 by si < S 2 < • • • < st- Define auxiliary variables Xq = —1 and Xd+i = 1, 
and let sq = 0 and = d+1. Let Oab be the indieator of the event that Xi < Xj for every a < i < j < b. 
Then g = Oi ^. Note that g = 01 ^ events for Osi,si+i (for i G {0,... ,f}) are 

independent given variables Xg^, ■ ■ ■, Xst ■ Therefore, 

t 

9cs = IE [5 I ,..., Xst] = IE [0si,si+i I ,..., Xst] . (5) 

i =0 


For eaeh i, we have 


IE \Psi,si+i I Xgj,... ,Xs^] — E \Os^,si+l I Xs-jX^.^^] — Pr [Osi^si+i — 1 | X^.jXs.,,,^) . 
If Xg. > Xg._^^, then Pr = 1 | Xg.,Xg._^^) = 0. Otherwise, 


Pr (0si,si+i — 1 I X5.,X5._|_j) — 


(^i+1 !)• 


We computed the probability above as follows: Given Xg- < the probability that Xj E [X^., 

for all j G {si,..., Sj+i} equals ((X^.^^ - Xg.)/2)^’'~^^~''"~^. Then, given that X^. < X^.^^ and Xj G 



[Xs., J for all j G the probability that Xs.+i < ••• < equals l/(si+i-s —1)! 

as all orderings of X^.+i,..., are equally likely. We get 


XSi + l-Si-l 


E[o,„„,. I = i{x., < x.,„} 


'i +1 1 


Plugging this expression in ([5]l, we obtain the following formula 

Pcs = n^- TTT 

2^»+i ^(sj+1 - Sj - Ij! 

n t fy _ y NSi+l—Si — 1 

i=0 V^Si + 1 -^Si) 

Observe that nLo(®*+i -«*-!) divides (Ei=o('®*+i - - 1))! = {d - l^l)!. Thus 


(, f, _ 1 ), n 

lli=0V'®*+l -*-/• i=o 


r 


'i+l 1 


Lli=0\'-i^i -2 -/• i=o 

is a polynomial with integer coefficients of degree at most d — |S|. Denote this polynomial by ps- 

_ j s Y ^ X ^ Y \ • • • ; ^d) _ \ ^ A ( Y Y ■ ■ ■ i 

9cs — I {^si < Xs 2 < ■ ■ ■ Xst\ — 2_^4 >t'{Xi, . . . ,Xd) 


Then, 


where the sum is over all permutations r' of {1,..., d}. Using the identity fs = we 

get a representation of S as 


fs = ^cj)^'{Xu...,Xd) 


Qs,t{Xi, ... ,Xd) 

2 '^d! 


where qs^t' are some polynomials with integer coefficients. □ 

Since T* is a sum of some basic ordering predicates (see Section l4d1) . we get the following corollary. 

Corollary 10. Let X be an instance of an ordering CSP problem ofarity at most k. Let ..., Xn) be 
the value of continuous solution (xi,..., Xn)- Then the Efron—Stein decomposition o/<l> has degree at most 
k. Moreover there exist polynomials qs,T with integer coefficients of degree at most k such that 

T>5 (xi, . . . ,Xn) = ^ (t)r{xi,..., Xn) 

tGT' 


where the summation is over some set T' of tuples of indices in S, and qs^r depends only on variables 
in{xi : i G S}. 
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4.3 Variance of Ordering CSP Objective 

In this section, we show that the variance Var [<h] = if $ (non-trivially) depends on at least v variables. 

Claim 11. There exists a sequence of positive numbers ad such that for every polynomial f{xi,..., Xd) of 
degree at most d with integer coefficients we have E > ad- 

Proof Consider the set Q of polynomials over xi,..., of degree at most d. Let Qi be the set of polyno¬ 
mials in Q, whose largest in absolute value coefficient is equal to 1 or — 1 . 

Denote V{f) = E . For every / E Qi, we have V{f) > 0 since / is not identically equal 

to 0 on {xi < X 2 < • • • < Xd]- Note that Qi is a compact set and V{f) is a continuous function on it. 
Therefore, V attains its minimum on Qi- Let ad = minygg^ ^(/) > 0. 

Now let / be a polynomial with integer coefficients of degree at most d. Denote the absolute value of its 
largest coefficient (in absolute value) by M. M is a positive integer and thus M > 1. We have f /M E Qi 
and thus 

V{f) = m2 . V{f/M) > M^ad > ad- 

□ 


Lemma 12. The following claim holds for some positive parameters (5^- Let X be an instance of arity at 
most k. Let $ = $5 be the Efron—Stein decomposition o/<h. Then for every set S either $5 = 0 or 

E [chi] > h- 

Proof- Let f3d = ad/{2^k\)‘^ > 0, where is as in Claim [TT] Assume that / 0. By Corollary [TOl 

as ( \ A ( - - - ,Xn) 

^S[Xl,---,Xd) = 2 ^ frixi,- - - ,Xd) --. 

tST' 

Note that all functions 4>t{xi, - - - ,Xd)qs,T{xi, - - - ,x„)/(2^fc!) have disjoint support, and, therefore, are 
pairwise orthogonal. Choose one tuple t gT' such that qs^r / 0. We have. 


E [T>|] > E 


frqlr/{2’^k\f 


E[f^ql,]/i2^k\f. 


By Claim nn E frqsr — and hence E [d>|] > ad/{2^k\)‘^- 


□ 


We say that $ depends on the variable x* if there exist two vectors x and x' that differ only in the z-th 
coordinate such that <h(x) / ‘h(x'). 


Theorem 13. Let I be an instance of arity at most k. Suppose that <h depends on at least v variables- Then 
Var [<1>] > v/ik/k. 

Proof Consider the Efron—Stein decomposition of $. Let V' = |J^. * e 5*}. Note that 

depends on all variables in V' and no other variables. Thus, \ V'\ > u. There are at least v/k non-empty sets 
5 with <1>5 / 0 since each such set S contributes at most k elements to V'- For 5 / 0, we have E = 0 
and hence Var [<I> 5 ] = E [<I>|]. By Lemma [T^ Var [<1>5] = E [<I>|] > if d >5 / 0 and 5/0. We have. 


as required. 


Var [T>] = ^ Var [T>s] > | {5 / 0 : / 0} |/fc > {v/kffdk 

s 


□ 
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5 Bonami Lemma for ordering CSPs 


We are going to apply Theorem |4] (the Bonami Lemma for the Efron—Stein decomposition) to the function 
/ = — E [<h], where is the objective function of the ordering CSP problem. In order to do that, we show 

now that / satisfies the condition of the theorem (Condition ([T]l) with some constant C that depends only on 
the arity of the CSP. 

Lemma 14. There exists a sequence of constants Ck such that the following holds. Let X be an instance of 
an ordering CSP of arity at most k. Let / = <1> — E [<h] and S C {!,...,/?} of cardinality at most k. Then 

E[fl]<C,E [fl]\ 

Proof We assume that S is non-empty as otherwise both the left and right hand sides of the inequality are 
equal to 0 (since /0 = E [/] = 0). Therefore, fs = 

Since |5| < k, we may assume without loss of generality that S' C {1,..., k}. Let Q be the set of all 
functions of xi,..., of the form 


^ ^ fr ( 3^1 ) ■ ■ ■ ) tC}^)qs,Tisr\ ) • • • ) 

where qs^r are some polynomials of degree at most k with real coefficients. By Corollary [TOl fs G Q- Let 
Qi = {h £ Q : \\h \\2 = 1}. Note that Qi is a compact set (since Q is a finite dimensional space; and || • II 2 
is a norm on it). Therefore, the continuous function W{g) = E is bounded when g £ Qi- Denote its 
maximum by Ck (note that Ck depends only on k and not on X). 

Letting g = fs/Wfsh ^ Qi^ we have, 

^[fs\=\\fs\\i-W{g)<Ck-\\fs\\i 


as required. □ 

6 Proof of Main Theorems 

In this section, we prove Theorems [8] and |3] We will need the following theorem. 

Theorem 15 (Corollary 1 from Alon, Gutin, Kim, Szeider, and Yeo |T]). Let X be a real random variable. 
Suppose that E [Y] = 0, E [Y^] = cr^, and E [Y^] < ba^ for some 6 > 0. Then Pr (Y > cr/(2\/6)) > 0. 

Proof of Theorem^ Let V be the set of variables that depends on (see Section 1431 for definitions). By 
Theorem [T 3 I Var [<1>] > \V'\ ■ fdk/k for some absolute constant /3fc > 0. By Lemma [T4l the function — ET* 
satisfies condifion ([ill of fhe Bonami Lemma for fhe Efron—Sfein Decomposifion (Theorem jUl wifh some 
absolufe consfanf Ck. Hence, ||<1> — E^lH < 81^(7^11$ — E^Hl. Applying Theorem ITSl fo fhe random 
variable 4> - ET- wifh o- = {\V'\l3k/k)^/‘^ and b = Sl^Ck, we gef fhaf Pr (4> > E4> + \V'\^/^/Kk) > 0, 
where k\ = AkCkSl^/Pk- Consequenfly, 

OPT= max ^{x) >E^ + /Kk = AVC + /Kk- 

We are now ready fo sfafe fhe algorifhm. The algorifhm compufes fhe Efron—Sfein decomposifion in 
fime Ok{m + n). Then, using fhe formula V = f € 5} (see Theorem [T3]). if finds fhe sef V 

also in fime Ofc(m + n). If considers fwo cases. 
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1. If \V'\ > Kkt^, then the algorithm returns OPT > AVG + t. 

2. Otherwise, if \ V'\ < Kkt^, the algorithm outputs the restriction of Z to the variables in V. This is a 
kernel for Z, since <h depends only on the variables in V'. 

□ 

To prove Theorem [3j we need to show how to find an assignment satisfying AVG + t constraints if 
\V'\ > Kkt"^. This can be easily done using 4A:-rankwise independent permutations. A random permutation 
a is m-rankwise independent if for every subset M C {1,... ,n} of size m, the order of elements in 
M induced by ci is uniformly distributed (the definition is due to Itoh, Takei, and Tarui 11211 ). Note that 
any m-wise independent permutation d is also an m-rankwise independent permutation. Using the result 
of Alon and Lovett l(2l, we can obtain a 4A:-wise independent permutation d supported on a set of size 
rf>{k) _ In Lemma (in Appendix lAl). we show that for some permutation a* in the support of d, we have 
valx(Q;*) > AVG + t. Hence, to find an assignment satisfying AVG + t constraints, we need to search for 
the best permutation in the support of a*, which can be done in time 


1 Bonami Lemma 


In this section, we prove the Bonami Lemma for the Efron—Stein decomposition (Theorem |4ll stated in the 
introduction. Our starting point will be the standard Bonami Lemma for Bernoulli ±1 random variables. 

Lemma 16 (see lffl ll21l ). Let f : {—1,1}"^ 'Kbe a polynomial of degree at most d. Let Xi, ..., Xn be 
independent unbiased Pl-Bernoulli variables. Then 

E [/(Xi, .. .,Xn)'^] < g'^E [/(Xi, . . .,Xnf]\ 


We will consider the following probability distribution in this proof. Let Z be a random variable equal 
to 3 with probability 1/4 and to —1 with probability 3/4. Denote by Z the probability distribution of Z. We 
first prove a variant of the Bonami Lemma for random variables distributed according to Z. 


Lemma 17. Let f : {—1, 3}*^ M be a polynomial of degree at most d. Let Zi,..., Zn be independent 
random variables distributed according to Z. Then 

E [/(Zi, . . . , ZnY] < Sl'^E [/(Zi, . . . , Znf] ' . 

Proof Consider 2n Bernoulli random variables Y (,..., Y/, Yf ,..., Y” (uniformly distributed in { — 1,1}). 
Note that random variables Zj = X/ + Yf + X/X/' are distributed in the same way as random variables Zj. 
Therefore, 


E[/(Zi,...,ZO"] =e[/(Zi,...,Z„)^ 


E[/(Zi,...,Z„) 2]2 = E /(Zi,...,Z„)' 


1 2 


Now /(Zi,..., Zn) is a polynomial of ±1 variables Y(,... ,Yn, X/',..., Yf of degree at most 2d. Applying 
Lemma [T^ to /(Zi,..., Z„), we get 


E 


f{Zi,...,ZnY 


< 9^'^E 


f{Zi,...,Znf 


1 2 


and, therefore, 

E [/(Zi,..., Zn)^] < Sl'^E [/(Zi,..., Z„)2] 2 , 

as required. □ 
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Now let/ G and / = be its Efron—Stein decomposition. Define polynomial Mj: M” ^ 

Mby 

Mf^s = [/I] for every S' C {1,..., n}, 

5c{l,.--,»^} *S5 

We now get bounds for moments of f{Xi, ..., Xn) in terms of moments of Mf{Zi,Zn). 

Claim 18. Note that E [Zi] = 0 and thus E [My(Zi,..., Z„)] = Mf ^0 = I/ 0 I = |E [/] |. 

Also, 

E [My(Zi, . . . , Znf] = • nU Zf]. 

Proof. Note that E [Zi] = 0 and thus E [Mf{Zi ,..., Zn)] = Mf ^0 = I/ 0 I = |E [/] |. 

Also, 

E[Mf{Zu...,Zn)^]= Y. Mls-ElUzf]. 

5c{l,...,n} i(zS 

Since E [Z?] = 3, we have E [Hies Zf] = 3l'^l, and therefore 

E[Mf{Z,,...,Zn)^]= Y ((l/3)""/'E[/|]'/')'.3l^l = E[/|]=E[/2]. 

Sc{l,...,n} 5c{l,...,n} 


□ 


Claim 19. Let f and C be as in the condition of Theorem^ Then 

E[f{X,,...,Xn)^] <CE[MfiZu...,Zn)^] . 


Proof Write, 

E[f]= Y ^ifsjsjssfs,], 

31.82.83.84 

E [Mj] = Y Mf^siMf,s2Mf,s3Mf,s4^ n ^*1 n ^*2 n ^*3 n ^*4 • 

51.82.33.84 iiSSi 12^82 is^Ss i 4 £S 4 

To prove the claim, we show that for every four sets Si, 52 , Ss, S 4 , the following inequality holds: 


E [/Sl/S' 2 /S 3 /S 4 ] < CMf^SlMf^S 2 ^},S 3 ^f,sf^ 

n 

n 

n ^*3 

n 3.. 


ilGSl 

i 2 £S 2 

*3G>53 

24 €54 


( 6 ) 


Note first that if some index j appears in exactly one of the sets Si, S 2 , S 3 , and S 4 then the expressions on 
the left and on the right are equal to 0 (by Property 3 of the Efron—Stein decomposition in Section [33]), 
and we are done. So we assume that every index j in Si U S 2 U S 3 U S 4 appears in at least 2 of the sets Sj. 
Denote the number of times j appears in sets Si by m{j). 
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Applying the Cauchy—Schwarz inequality three times, we get, 


elfsjsjsjs.] < E [(/s.fe)"]'''"E = (E [/I. . fl] E [fl ■ /IJ)'-'" 

< (e [/!,] E [fl] E [/IJE [/IJ 

= ll/SilU • ll/SalU • ll/SalU • ||/S4l|4- 

By the condition of Theorem |4] and the definition of coefficients My 5 ., 

IE [fSi f32/33/34] < ll/silU • II/S2II4 • II/53II4 • II/S4II4 

<C(E[/JJE[/|,]E[/|,]E[/JJ)‘/" 


We also have. 




■ 4 



n n n ^*3 n ^*4 

= E 


= 11 *: 

rymii) 

iiES'i 12^82 isGSs 24€54 


.r=l ieSr 

i&\Jr Sr 



nn 

r=l i£Sr 


E 


z; 


m{i) 




(V) 

( 8 ) 

(9) 

( 10 ) 

( 11 ) 


We compute E 


Z 


771 ( 2 ) 


for m{i) G {2, 3,4}. We get EZf = 3, EZf = 6 and EZ^^ 


y/S, (EZ?)^/^ = > y/3 and (EZ^^)^/^ = v'^ > y/3, and, consequently. 


21. Thus, (EZ2)V2 = 


E 


■ 4 


nn^- 

_r=l i^Sr 


> 


4 

3 I /2 ^ 3 (|Si| + |S 2 | + |S 3 | + | 54|)/2 

r=l i£Sr 


Since all coefficients My 5 are non-negative, we get from (ITOl) and (fT^ that inequality ® holds. 
Proof of Theorem^ By Lemma [TtJ we have 


||My(Zi, . . . , Zn)\\i < 81''||My(Zi, . . . , Zn)\\i 


( 12 ) 

□ 


From Claims [m and [191 we get 

E [/(Xi,.. .,Xn)^] < CE [My(Zi,.. .,Znf] < sZCE [My(Zi,... ,Z„)2]2 = sZCE [/{Xu ... ,X„)2]^ 


□ 


8 General Framework 

8.1 Filtered A-Lattice of Functions 

In this section, we generalize the result of the paper to a more general class of constraint satisfaction prob¬ 
lems having a lattice structure. In Section |9l we show that LP CSPs and valued CSPs with “piece-wise 
polynomial predicates” (see Section |9]for the defintion) have a lattice structure. 
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8.2 Discussion 


We note that in our proofs we used only few properties of ordering CSPs. Specifically, in Theorem |9j we 
showed that all functions in the Efron—Stein decomposition of the basic ordering predicate are in the set 



Since is closed under addition (the sum of any two functions in is in we got that all 

functions in the Efron—Stein decomposition of the ordering CSP objective <1> are also in Then in the 
proof of Eemma [T2l we showed that every non-zero function in has variance at least /3fc (where (3k 
depends only on k), and this was sufficient to get the result of the paper. To summarize, we only used the 
following properties of the set of functions 

A. all functions in the Efron—Stein decomposition of each predicate are in 

B is closed under addition, 

C. every non-zero function in has variance at least (3 (for some fixed (3 > 0). 

8.3 Filtered ^T-Lattice of Functions 

We now formalize properfies A, B, and C in fhe definitions of A-laffice of funclions and tillered A-lallice of 
funcfions. Recall firsf fhe definifion of a laflice. 

Definition 20. Let V be a finite-dimensional space and Cbe a subset of V. We say that L is a lattice in V 
if for some basis vi,...,VrofV,we have C = : ai,..., G Z}. We say that vi,... ,Vr is the 

basis of the lattice C. 

Now we define an A-lalfice of funclions. 

Definition 21. Let {Ll,p) be a probability space. Consider a set T of bounded (real-valued) functions 
f{xi ,..., Xk) on Ll; T C We say that T is an A-lattice of functions ofarity (at most) k onLl if it 

satisfies the following properties. 

1 . T is a lattice in a finite dimensional subspace of Loo{Ll^). 

2. If we permute arguments of a function in T, we get a function in T. Specifically, if f £ and tt is a 
permutation of{l, ...,k} then g(xi,. ..,Xk) = /(x^(i), • ■ ■ ,x^(fc)) G 

For an A-lattice T, we write that a function f Gr J- if f is in T ^ter possibly renaming the arguments 
of f (in other words, f is in T as an abstract function from to Mjo 

Clearly, every A-lalfice (F of funcfions salisfies properly B. Since (F is discrefe, if also satisfies prop¬ 
erly C (we will prove lhaf formally in Claim |2^ . We also wanl lo ensure lhaf if salisfies an analog of 
properly A. To fhis end, we consider fhe averaging operator Ai, which lakes fhe expeclalion of a function 
wilh respecl to variable Xi and require lhal Ai maps every function in Ihe lallice to a function in Ihe lattice. 

^For example, let T be an A-lattice of functions of the form ax\ + bx2 where a,b G Z. Then X3 -F 5 xt Gr T, since after 
renaming xz to xi and X7 to X2 we get *3 -F 5a;2, which is of the form axi -F bx2. 
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Definition 22. For i E k}, let Ai be the averaging operator that maps a function f ofarity k to a 

function Aif ofarity k — 1 defined as follows: 

= / f{xi,...,Xk)dp{xi) 

Jn 

— E/(^1) ■ ■ ■) 1^1 — tci,, Xi—\ — Xj_i, — Xi-\-i,..., Xji^ — Xkf 

Definition 23. We say that a family of sets {Xaja (indexed by integer a > 1) is a filtered A-lattice of 
functions ofarity (at most) k if it satisfies the following properties. 

1. (Fa is an A-lattice of functions ofarity k on O. 

2. {Fa} is a filtration: Fa C Fa' for a < a'. 

3. For every a there exists of, which we denote by of = a(a), such that the operator Ai maps Fa to Fa' 
(for every i E {1,..., /c}j. 

We remark that is a filtered A-lattice. We are going to prove that our result for ordering CSPs 

holds, in fact, for any constraint satisfaction problem with predicates from a filtered A-lattice. 


8.4 General A-CSP(J^„, a) 

Definition 24. Consider a probability space (fl, p). Let Fa be a filtered A-lattice of functions and oq is an 
integer. An instance X of General A-CSP(Fa, chq) consists of a set of variables xi,..., Xn, taking values in 
Ll, and a set of real-valued constraints of the form /(xj^,..., Xj^.) where f Gji Fa. The objective function 
<h(xi,... ,Xn) is the sum of all the constraints. General A-CSP(Fa, oq) asks to find an assignment to 
variables xi,..., x^ that maximizes <h(xi,..., Xn). 


We denote the optimal value of an instance X by OPT = ess sup^,^ a;„en • • • > ^n) and the aver¬ 
age by AVG = E [<h(Xi,..., X^)], where Xi,..., Xn are independent random elements of distributed 
according to the probability measure p. 


Remark 25. Note that we follow the standard convention that two functions f,gG Loa{Tl^) cire equal if 
/(xi,..., Xfc) 7 ^ ^(xi,..., Xk) on a set of measure 0. That is, we identify functions that are equal almost 
everywhere. Accordingly, we define OPT as the essential supremum of d>.' OPT is equal to the maximum 
value of M such that 

Pr (<f> > M — e) > 0, 


for every e > 0. 


We prove a counterpart of Theorem [ 8 ] for the General A-CSP problem. 


Theorem 26. There is an algorithm that given an instance of General A-CSP(Fa, k, ao) and a parameter 
t, either finds a kernel on at most variables (where k depends only on the filtered A-lattice {Fa} and 
numbers k, ao) or certifies that OPT > AVG + t. The algorithm runs in time 0(m + n), linear in the 
number of constraints m and variables n (the coefficient in the 0-notation depends the filtered A-lattice 
{Fa} and numbers k, ao). 

We assume that computing the sum of two functions in Fa requires constant time and that computing 
Aif requires constant time (the time may depend on a). 
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We first prove analogues of Theorem|9]and Corollary [TO] for General ^d-CSP. 

Lemma 27 (cf. Theorem|9l). Consider a probability space (fl, p) and a filtered A-lattice {J-a}- For every 
a, there exists a so that the following holds. For every f E Ta and every S C {1,... ,k}, fs G T^- 

Proof. Let ao = a, oi = a{ao) (where a is as in the definition of a filtered j4-lattice), 02 = a{ai), and so 
on; ctj = a(Q;j_i). Let a = maXjg{o,...,A:} Consider a function f G and a set T C {1,..., A:}. By ([3]l, 

f^T = E [f{Xi, ...,Xn)\ all X, with iGT]. 

Denote the elements of {1,..., A:} \ T by ii < • • • < it (where t = k — \T\). Note that 

f(zT ... Ai^f G Cl Fqj 

by the definition of a filtered A-lattice. Now by (|4ll, 

fs = 

TCS 

Since Fa is a lattice and fs is a linear combination, with integer coefficients, of functions ft^T (all of which 
are in FF), fs is in Fa. □ 

Since the set of functions Fa is closed under addition, we get that for every General A-CSP(Xq,, a) 
instance I with objective function $, all functions $5 are also in Fa. 

Corollary 28 (cf. Corollary 1101) . Consider a probability space (D,/i) and a filtered A-lattice {Fa}. Fet 
X be an instance of General A-CSP(Fa, oq) and let $ be its objective functions. Then for every subset 
S' C {1,..., n} of size at most k, $5 Gr Fat for every subset S of size greater than k, = 0- 
Furthermore, the Efron—Stein decomposition o/<l> can be computed in time 0{m). 

8.5 Compactness Properties of Filtered A-Lattices 

We now prove counterparts of Claim[IT]and Lemmafldlfor General A-CSP. 

Claim 29 (cf. Claim fTTI). Consider a probability space (D, p). Let F be an A-lattice of functions ofarity k 
on D. There exists a positive number such that for every function f G F,E, [/^] = E [/(Xi,..., X^)^] > 

fi. 

Proof. Let fi,..., fr be the basis of lattice F. Consider the linear span Q of functions fi,..., fr (the 
set of all linear combinations with real coefficients). Vector space Q is finite dimensional. Let Qi = 
Oifi : maxj |oi| = 1}. Note that Qi is a compact set. All functions in Qi are non-zero (since fi,..., fr 
are linearly independent), and, therefore, E [p^] > 0 for every g G Qi. Since Qi is compact, min^gQ^ E 
infggQ^ E [g"^] > 0. Denote fi = min^gQ^ E [p^]. 

Now consider a non-zero function f G F. Write / = Ffi- Let M = max* \ ai\. Since f fi 0 and 
all coefficients Oj are integer, M > 1. Note that f /M G Qi. We have, 

E [f] = M^E [(//M)2] > /3m2 > fi, 


as required. 


□ 
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Lemma 30 (cf. Lemma [T4b . Consider a probability space {CL, p). Let T be an A-lattice of functions ofarity 
k on Cl. There exists a constant C such that the following holds. Let f be a functions of arity at most k, 
which depends on a subset of variables {xi,..., x^}- Assume that f G/j (see Definition I2il). Then 

E[f] <CE[f]\ 

Proof. Since function / depends on at most k variables among xi,..., Xn, we may assume without loss of 
generality that it depends on a subset of xi,..., x^. Then f £ (F. Let Q be the linear span of (F. Note that 
Q is a finite dimensional vector space of functions. 

Define Qi = {h £ Q : \\h \\2 = 1}. Since Qi is a compact set, the continuous function 

W{g) = E [/] 

is bounded when g £ Qi. Denote its maximum by C. 

Letting p = //||/|| 2 , we have, 

E[f^]=E[g^]-\\f\\t<C\\f\\t, 


as required. □ 

8.6 Variance of A-CSP Objective 

We now prove a counterpart of Theorem [13] for General A-CSP(Ja, k, a). 

Lemma 31 (cf. Theorem [T3l). Consider a probability space (D, p). Let {J-a} be a filtered A-lattice and aQ 
be an integer. There exists a number /3 > 0, which depends only on {J-a} tind ao tiuch that the following 
holds. Let X be an instance of General A-CSP(J-a, cxq) and a be a parameter. Either X has a kernel on at 
most (kj variables or Var [<1>] > cr^. Moreover, there is an algorithm that either finds a kernel on at 
most {k/l3)a'^ variables or certifies that Var [<1>] > cr^. The algorithm runs in time 0{m + n), where n is 
the number of variables and m is the number of constraints. 

Proof. Let a be as in Lemma |27j Since Fa is an ^d-lattice, by Claim |29l there exists /3 > 0 such that 
ll/lli ^ P for every non-zero / £ Fa. Note that does not depend on n and t. 

Consider the Efron—Stein decomposition of <1). Let V' = ^ ^ ^}- Function depends 

only on variables in V. Therefore, the restriction of X to variables in V is a kernel for X. Let v = \ V'\. 

V < (kjff)a^, then we are done. So let us assume that v > (kj. There are at least vjk non-empty sets 
S with <^5 / 0 since each such set S contributes at most k variables to V. Note that E [< 1 > 5 ] = 0 for 5 / 0 
and hence Var [$ 5 ] = E [<h|]. Since $5 £ Fa, Var [$ 5 ] = E [<^1] > (5, if $5 / 0 and 5/0. We have, 

Var [<1>] = Var [< 1 > 5 ] > | {5 / 0 : < 1>5 / 0} |/3 > {v/k)l3 > cr^, 

s 


as required. 

Note that we can compute the Efron—Stein decomposition of in time 0(m -|- n) and then find the set 
V' in time 0{m + n). If \ V'\ < {k/(3)a‘^, we output the restriction of X to V' (which we compute in time 
0{m + n)). Otherwise, we output that Var [<1>] > cr^. □ 
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8.7 Proof of Theorem |26] 

We are ready to prove Theorem l26l 

Proof. Let (5 be as in Lemma [3T] and C be as in LemmaLet cj^ = 4 • Denote 

/ = $- E[4>] = 4> - AVG. 

Note that esssup^-^fin /(xi,..., Xn) = MAX — AVG. 

By Lemma[3T] either I has a kernel on at most {k/I5)a‘^ = (4 • 81^ • kG / j3)f^ variables or Var [/] > cj^. 
In the former case, we output the kernel, and we are done. In the latter case, we show that OPT — AVG > 
t. Assume that Var [/] > cj^. By Theorem l4l and Lemma II/II4 < By Theorem [T5l 

Pr (/ > cr/(2 • > 0, and hence MAX - AVG > a 1(2 ■ > t. 

The algorithm only executes the algorithm from Lemmaso its running time is 0(m + n). □ 


9 Piecewise Polynomial Predicates 

In this section, we present an interesting example of a filtered A-lattice, the set of piecewise polynomial 
functions. As a corollary, we get that the problem of maximizing the objective over average for a CSP with 
piecewise polynomial functions is fixed-parameter tractable. 

Definition 32. Let us say that a subset P of [—1,1]^ is b-polyhedral if it is defined by a set of linear 
inequities on xi,..., Xk, in which all coefficients are bounded by b in absolute value. In other words, P is 
a b-polyhedral set if for some t there exist a k x t matrix A and vector c (with t coordinates) such that 
P = {x : Ax < c} (here, the inequality Ax < c is understood coordinate-wise), and every entry of A and 
coordinate of c is bounded by b in absolute value. We denote the indicator function of a polyhedral set P by 

Ip- 

Definition 33. We denote the set of polynomials f{xi,... ,Xk) with real coefficients of degree at most d by 
■ ■ ■, Xk]; we denote the set of polynomials f{xi ,..., Xk) with integer coefficients of degree at most 
dbyZ<d.[xi,...,Xk]. 

Definition 34. We say that a function f(xi ,..., Xk) ■ [—1,1]*^ —>• M is piecewise polynomial on polyhedral 
sets or (d, b)-PPP if f is the sum of terms of the form g{xi, ..., Xk)Ip{xi, ..., Xk), where g G Z<rf[xi,..., Xk] 
and P is a b-polyhedral set. 

We note that every (d, 6)-PPP function can be written in the following “canonical form”. Consider 
all hyperplanes in of the form {a,x) = c, in which a,c £ {—b,... ,b}^. They partition [—1,1]'^ 
into polyhedrons. We call these polyhedrons elementary polyhedrons and denote the set of all elementary 
polyhedrons by Peiem- Note that each 6-polyhedral set is a union of elementary polyhedrons. Thus we can 
write every (d, 6)-PPP function / as follows: 

f(xi,...,xk)= ^ Ip(xi,...,xk)gp{xi,...,xk), (13) 


where 5 -p G Z<rf[xi, ..., x^]. 
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Theorem 35. Let Ll = [—1,1] and /x be the uniform measure on [—1,1]. Let 

= {/ : («!/) cin (a, a)-PPP function of variables xi,..., x^} ■ 

Then is a filtered A-lattice of functions. 

Proof First, we prove that each set is an ^-lattice. It follows from (fTSl) that Fa is a lattice with basis 
Ip{xi ,..., Xk)g{xi ,..., Xk)/al, where P G Veiem and g{xi ,..., Xk) is a monomial of degree at most a 
(i.e., g is of the form xf ... x^ff). Since every monomial g is bounded on [—1,1]^, every basis function is 
bounded, and, therefore, all functions in Fa are bounded. The definition of Fa is symmetric with respect to 
xi,..., Xk, hence if we permute the arguments of any function / G Fa, we get a function in Fa- 

Now we show that Fa is a filtered ^-lattice. The inclusion Fa C Fa' for a < a' is immediate. It 
remains to show that for every a there exists a' such that Ai maps Fa to Fa'- Let a = and a' = 

(we note that, in fact, we can choose a much smaller value of o'; however, we use this value to simplify the 
exposition). Observe that all integer numbers between 1 and a divide a. 

It is sufficient to prove that Ai sends every basis function 

/(xi, ...,Xk) = /p(xi,.. .,Xk)gixi,.. .,Xk)/al 

to Fa'. Moreover, since the set of functions Fa is invariant under permutation of function arguments, we 
may assume without loss of generality that i = k. Denote g = x'f ... x^*, where ri + ... < a. Consider 

the set of linear inequalities L that define polyhedron P. All coefficients in each of the inequalities are 
bounded by a in absolute value. Let Lq be those inequalities that do not depend on x^ and Li be those 
that do depend on Xfc. We rewrite every inequality in Li as follows. Consider an inequality in Li. Let 
A be the coefficient of x^ in it. We multiply the inequality by a/A G Z, and if A < 0, we change the 
comparison sign in the inequality to the opposite. Finally, we move all terms in the inequality other than 
axk to the right hand side. We get an equivalent inequality of the form either axk > f(l, xi,..., Xk-i) or 
axk < ti(l, xi,..., Xfc_i), where I and u are linear functions with integer coefficients bounded by aa in 
absolute value. Denote the inequalities of the form axk > 1{1, xi,..., Xk-i) by 

aXk P • • • ,^/c—l), aXk P ^ 2 ( 1 ,^!, • • • , , • • • , aXk P ^p(l,Xi, . . . , Xk—\f 

and the inequalities of the form axk < u(l, xi,..., Xk-i) by 

axk < tti(l,xi,.. .,Xk-i),axk < U 2 {l,xi,.. .,Xk-i), ...,axk< Ug{l,xi,.. .,Xk-i). 

Let Mo be the set of points x = (xi,... ,Xk) such that Zj/(xi,... ,Xk) = lj"{xi,... ,Xk) oruj/(xi,... ,Xk) 
Uj"{xi ,..., Xk) for some j' / j". Note that Mq has measure 0. 

Define pq polyhedrons For ji G {1,... ,p} and j 2 £ {Ij • • ■ j q}, let Pjij 2 be the 

polyhedron defined by the following inequalities: 

1 . all inequalities in Lq, 

2 . inequality xi,...,Xfe_i) < Ujfil,xi,... ,Xk-i), 

3. inequalities lj{l,xi,.. .,Xk-i) < ljfil,xi,.. .,Xk-i) for every j /ji, 

4. inequalities Uj{l,xi,..., Xk-i) P Uj 2 {^: xi,..., Xk-i) for every j / j 2 . 
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Inequalities in items 1-A are equivalent to the following condition (except for points in Mq): 


max/j(l,xi,... ,Xfc_i) = ... ,Xk-i) < ... ,Xk-i) = minrtj(l,xi,... ,Xfc_i). 

j j 

(14) 

Note that if (xi,..., Xk) ^ P \ Mq then (xi,..., Xk-i) G Pjij 2 for 

ji = argmax/j(l,xi,... ,Xfc_i) and j2 = argmin'Uj(l, xi,... ,Xfc_i). 
j j 


Also note that polyhedrons -Pjyjj disjoint. Now let 

hnj^ixi, .. ■,Xk-i) = 2 a\a'^k+i{rk + '' • > xi,..., Xfc_i)^''+^)xj;i .. 

Let x' = (xi,... ,Xfc_i) be a point in [—1,1]^“^. Consider two cases. 


Case 1 First, assume that x' G Pjij 2 for some ji and Then 

1 

Mf{x') = ^ J Ipixi, ■■■, Xk)gixi,Xk)dxk. 

The point (xi,..., Xk) satisfies all inequalities in Lq since x' G Tjua- Hence, Ip{x\, ..., Xk) = 1 if and 
only if it satisfies all inequalities in Li, which are equivalent to 


maxZj(l, xi,..., Xk-i) < axk < minuj(l, xi,..., x^-i). 
j 3 

Combining this with (ITdl) . we get that Ip(xi,..., Xk-i) = 1 if and only if 


Xk G 


^ ^jl (f ) ! • • ■ ) Xk—l) , 0^3'^ (f ) ) ■ ■ ■ ) Xk—l') 


Therefore, 

Akf{x') 


—i)/ 


- x[^ .. .xl'^dxk 


(Mj 2 (l,Xl,...,Xfc-l)'’'= + ^ -Zjl(l,Xl, 


2 a\a^k+i(rk + 1) 
hj,j^{x'). 


,Xk-iY>^^^)xlP..xlti 


Case 2 Now assume that x' ^ Pjij 2 for every ji and j 2 - Then there is no Xk such that (xi,..., Xk) G 
P \ Mq. Therefore, 

Akf{x') = ^ j Ipixi, ■■■, Xk)gixi,Xk)dxk = ^ j O' ^(xi, • • •, Xk)dxk = 0. 

(The equality holds on a set of full measure; see Remark l25l) 

We conclude that 

^kfix') = IPpj^hj^j 2 {x')- 
jlj2 

All coefficients in the inequalities that define Pjij 2 are bounded by 2ao in absolute value, and 2Q;!o'’'“+^(rfc + 
l)h{x') G Z<a+i- Therefore, 2Q:!a'’''+^(rfc +is an (q: +1,2aa)-PPP function. Thus A^./ G Pa'- D 
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From Theorems |2^ and [35l we get the following corollary. 

Corollary 36. For every k, d and b, there is an algorithm that given an instance of a constraint satisfaction 
problem on n variables xi,... ,Xn with m real-valued constraints, each of which is a {d, b)-PPP function of 
arity k, and a parameter t, either finds a kernel on at most nf^ variables or certifies that OPT > AV 
The algorithm runs in time 0{m + n). (The coefficient k and the coefficient in the O-notation depend only 
on k, d, and b). 

Proof Let a = max(d, 6 ). We apply Theorem |2^ to filtered ^-lattice Fa from Theorem [35] and get the 
corollary. □ 


Since every constraint in a (fe, 6)-LP CSP problem is a (0, 6)-PPP function of arity k (see Definition |5]l, 
we get the following corollary. 

Corollary 37. For every k and b, there is an algorithm that given an instance of (k, b)-LP CSP either finds 
a kernel on at most nf^ variables or certifies that OPT > AVG -\-t. The algorithm runs in time 0{m + n) 
(The coefficient k and the coefficient in the O-notation depend only on k and b). 

Remark 38. Note that for an instance of{k, b)-LP CSP, we have 

OPT = max <l)(xi,..., Xn) = ess sup ‘h(xi,..., x^), 

—1,1] XI ,...,3;nG[—1,1] 

since all LP constraints are strict. If we were to use non-strict “less-than-or-equal-to” and “ greater-than- 
or-equaTto” LP constraints, we would have to define OPT as esssup^,^ <h(xi,... ,Xn), and 

not as maXj,^^ T'(xi,..., Xn), since, in general, esssup^.^^ <h(xi,..., Xn) might not be 

equal to max^,^ ^(xi,..., x^). For example, consider an instance of (2,1)-LP CSP with two 

constraints xi < X 2 and X 2 < xi; we have 


OPT = ess sup <h(xi,..., Xn) = 1, 

xi,...,a;„g]-l,l] 


but 


max <h(xi,..., Xn) = 2 . 

xi,...,*„€]-!, 1 ] 


(The maximum is attained on a set of measure 0, where xi = X 2 -) 
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A Rankwise independent permutations 

In this seetion, we prove the following lemma. 

Lemma 39, If a is a random 4k rankwise independent permutation and \ V'\ > Kkt^, then for some a* in 

the support of a, valx(a*) > AVG + t. 

Proof Let a be a permutation uniformly distributed among all n\ permutations. The random variables 

vali(a) and are identieally distributed. Henee, 

Var[valx(a)] = Var[<b], and || vali(a) — AVG ||4 = ||<1> — AVG\\i. 

Observe, that 

Var[valx(a*)] = Var[vali(a)], and || vali(a) — Al/G ||4 = || valx(Q;*) — AVG\\i, 
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since for every four predicates vri, 7r2, tts, 7r4 G If, we have 


E[7ri(a*)7r2(a*)] = E [7ri(a)7r2(a)] ; 

E [7ri(a*)7r2(a*)7r3(a*)7r4(a*)] = E [7ri(Q;)7r2(a)7r3(a)7r4(a)] . 

Hence, as in TheoremUl Var[vali(a*)] > \V'\l3k/k and ||vali(a*) — AHGUl < 81^(7^11 vali(a*) — 
Consequently, by Theorem [T5l Pr(vali(a*) > AVG + f) > 0. This concludes the proof. □ 
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